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1 Introduction 



^j". Two way relaying is a promising technique to improve the performance of wireless networks because it 

dramatically enhances the spectral efficiency comparing to traditional one way relaying. This technique is 
flourished by seminal works [T] [2] [3] [I], which introduce different two-way relaying schemes. Generally, these 
schemes can be divided into two categories: three-step schemes and two-step schemes. In three-step scheme 
also known as decode-and-forward (DF) relay, two end nodes in two way relay channel transmit their packets 
sequentially in the first two time slots and the relay node codes two received packets by applying bitwise 
XOR operation and broadcasts the coded packet to two end nodes in next slot. In two-step schemes also 
" known as amplify-and-forward (AF) relay or denoise-and-forward (DNF) relay depending on the operation 

applied in the relay node, two end node send packets concurrently to the relay node in the first time slot 
and the relay node directly broadcasts the received waveform (AF relay) or perform physical layer network 
coding (PLNC) on received signals and broadcast the coded packet (DNF relay) to two end nodes in the 
second time slot. As shown in [3], two-step schemes are more efficient than three-step scheme especially in 
high SNR regime. Hence in this paper we only take two-step two way relaying schemes into consideration. 

Motivated by enhancing the performance of the network, many research papers investigate how to apply 
the two way relaying technique in wireless networks. Most of these papers focus on networks with regular 
" topologies, such as star topology [5], layered topology [6], two-tier topology [7], a routing path in multihop 

networks [8j, source- relays-source topology (9] [10] and etc. For general ad hoc networks, literature [11] 
provides a solution for applying two-way relaying in such networks based on transmission scheduling. However 
the optimal scheduling in this paper turns out to be NP-hard, and similar conclusion is further confirmed 
in [12]. Hence the scheduling based solutions for applying two-way relaying are more than complicated in 
general ad hoc network, and are not practical in realistic systems. Therefore to apply two way relaying in 



o 

1 general networks in a more practical and more scalable way, random access schemes should be taken into 

consideration. 

Recently, a few papers deal with topics related to the design of random access MAC protocol to support 
two-way relaying. Majid Khabbazian [13] presents a MAC design for analog network coding (similar with 
AF relay). However the solution is on theoretical level and far from a practical design. In addition, Shiqiang 
Wang [14] proposes a distributed MAC protocol to enable the application of PLNC in the network. However, 
the protocol suffers from some defects. First, the protocol has no mechanism to guarantee the performance 
gain when there are no bi-direction data flows, namely when not both two end nodes have packets to each 
other the throughput would degrade to the level when no two way relaying is applied. This limits the usage of 
this protocol. Also, in the protocol the transmission is initiated by the relay node instead of transmitters of 
data. This requires that nodes have to provide their queue information to neighbor nodes, and this operation 
would result in the degradation of the performance due to the overhead and the increase of mean packet 
delay. To our best knowledge, no better solution schemes are published up to now. Hence a more practical 
and more efficient random access MAC protocol which supports two way relaying is high needed. 

To support random access MAC protocol, physical layer implementation of two way relaying should be 
revised. Some requirements of two way relaying such as the symbol synchronization and frame alignment 
are difficult to meet in random access protocol, and hence should be removed. Some previous works [2] 
[15] [16] [17j [18] [19j remove or relax these requirements and make the two way relaying technique more 
practical. However, these solutions have some imperfections respectively and are insufficient to serve as 
powerful physical layer schemes to support random access MAC protocol. In 2 , analog network coding 
scheme is proposed without any synchronization requirement. However, the scheme is designed only for 
MSK modulation and cannot support QAM modulation which is more widely used. The OFDM based 
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solution, proposed in [TS], only allows the asynchronization of packets up to the length of cyclic prefix of 
the OFDM symbol. This limits the usage of this method in many scenarios. The schemes based on linear 
convolution coding (16) |17| suffer from the similar issue, i.e. the scheme only can tolerate the symbol level 
misalignment but frame synchronization is still required. Also asynchronous two way relaying schemes based 
on PLNC, proposed in [T5] [T5], have the problem that coding scheme is sensitive to channel gains and 
modulation schemes adopted by two end nodes as revealed in [2D]. The author of [5UJ also shows that the 
issue is further complicated when large constellation size is used. This inflexibility results that these schemes 
are not suitable for random access networks especially mobile ones. Therefore to provide more freedom on 
the design of a widely applicable random access MAC protocol, it is necessary to design a new physical 
layer scheme for two way relaying, which is flexible, fully asynchronous and feasible for multiple types of 
modulation schemes. 

In this paper, we propose not only a random access MAC protocol to support two-way relaying but also 
a practical physical layer schemes for two-way relaying to facilitate the MAC design. To our best knowledge, 
we are the first to provide a integrated design including both physical layer and MAC layer for applying two 
way relaying in general ad hoc networks. This integrated design deliveries a practical and high-performance 
solution. Specifically, we first present a new physical layer decoding algorithm for end nodes in two way relay 
channel. The algorithm is based on a bunch of techniques such as oversampling, joint channel estimation 
and waveform recovery. This decoding algorithm in end nodes and the amplify-and-forward operation in 
relay node compose the our physical layer scheme. The advantage of our physical layer scheme is manifold. 
First, our scheme does not require any synchronization. Second, our physical layer is feasible for any linear 
modulation schemes. Also, all of information required for physical layer operations, e.g. channel coefficient, 
are locally obtainable. Hence our physical layer scheme adds little extra burden on MAC protocol. Then 
we propose a random access MAC protocol TREAN (Two-way Relaying Enhanced Ad- hoc Network). It 
performs RTS/CTS-like queries to build the cooperative configuration for two-way relaying. After that 
the new physical layer scheme is applied to conduct two-way relay transmission with least coordination. 
Also, different modes of TREAN are designed to enlarge the application scope of TREAN protocol. The 
basic mode is simple, flexible and requires least management. It works well in networks with bi-directional 
data flows and is also suitable for mobile network. The extended modes of TREAN protocol is designed 
to deliver high performance even when bi-directional data flow is absent. Moreover, we provide accurate 
analysis and approximate derivation on the saturation throughput for small-scale network and large-scale 
network respectively. These theoretical results can serve as tools for performance evaluation or guidelines for 
practical designs. To validate our design and analysis, we perform the simulation in various settings. Results 
show that TREAN protocol can significantly enhance the throughput of the network. Also, the theoretical 
analysis is verified by the fact that analytic results and simulation results are matched well. 

The rest of this paper is organized as follow: Section II introduces our physical layer scheme; Section III 
describes details of the TREAN protocol; Section IV provides throughput analysis when TREAN protocol is 
adopted in the network; Section V evaluates the performance of our integrated design; Section VI concludes 
the paper. 

2 New Physical Layer Decoding Scheme for two way relaying 

In our physical layer scheme for two way relaying, two end nodes transmit their packets to the relay node 
without any synchronization. After receiving superposed packets, the relay node simply amplifies and for- 
wards the received signals to end nodes. Then two end nodes extract their desired packets from superposed 
waveform broadcast by the relay node with our new decoding algorithm. In this section, we present this 
new decoding algorithm. The basic idea of this new physical layer technique is to exploit the oversampling. 
First, the superposed waveform broadcast by the relay node is sampled by the receiver with the frequency 
higher than symbol rate. Then, we deduct the components of the known packet from these samples. Finally, 
we can recovery the waveform of unknown packet with processed samples according to Shannon sampling 
theory and decode the obtained waveform with general decoding procedure. The details of our new physical 
layer scheme are explained as follow. 
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Sampling and packet detection First of all, we show the packet layout for our physical layer as Figure 
[TJ Similar with the design in [2], we divide the training sequence field into two parts and add them at the 
beginning and the ending of the packet as preamble sequence and postamble sequence respectively. The 
preamble is a L p -bit pilot sequence {p n } (1 ^ n ^ L) and the postamble is another one which is orthogonal 
to the preamble sequence. The objective of this layout is to avoid the situation that the pilot sequence of 
one packet is completely collided with the data field of another packet when two packets are superposed at 
the relay node. This situation may result in the failure of channel estimation. However, if two packets have 
similar sizefl with our design at least one pilot sequence, preamble sequence or postamble sequence, would 
be almost free from the interference from the data field of another packet, as shown in Figure O 



preamble seq. 



postamble seq. 



Figure 1: Packet layout for our physical layer technique. The data field consists of PLCP header added in 
physical layer and the frame from MAC layer. 

Under random access MAC protocol such as 802.11 and TREAN, the receiver cannot know when a packet 
arrives. Hence the receiver has to keep sampling the channel to detect the arrival of packets. In our physical 
layer, we require that the receiver should keep sampling signals in the channel with the frequency twice as 
the symbol rate of the transmission. Let s[l], s[2], s[3] . . . denote samples obtained in sampling procedure. 
To detect the beginning of a packet, the receiver calculate the correlation between pilot sequence and these 
samples as 

L-l 

S[i] = s[i + 2k]p[k + l] (1) 

When S[i] and S[i + 1] spike consecutively, we can conclude that a packet arrives and s[i] is the first sample 
for this packet. Similarly, we can locate the ending of a packet in the same way. Also, we note that when 
the preamble of a packet is corrupted by other packets, the method still works to indicate the beginning of 
the packet. This is due to the fact that the correlation between pilot sequence and an irrelevant sequence is 
expected much less than that between the pilot sequence and itself, and hence the existence of components 
of other packet in samples has less impact on the occurrence of the correlation peak indicating the arrival of 
the packet. Therefore the receiver is able to detect the beginning of each packet from superposed waveform 
with correlation based method. 



Channel Estimation Once detecting the arrival of the superposed waveform broadcast from the relay 
node, the receiver of the end node needs to estimate the channel gains experienced by both packets. If the 
channel estimations are performed respectively for two packets considering another packet as interference, 
the accuracy of the estimations are poor due to the existence of strong interferer. Therefore we jointly 
estimate the channel coefficients for two packets, which is similar with idea in [22] . 

For the sake of clarity, we name two packets as packet F and packet S according to their transmission 
order. Let h F and hs denote the fading coefficients experienced by packet F and packet S, cf[«] (1 ^ n ^ L) 
and cg[n] (1 ^ n < L) represent the symbol sequences for two packets, g F (t) and <7s(i) stand for the distorted 
pulse shapes after transmission, and T d denote the relative delay between two packets. Also, w(t) is the 
noise process. We should emphasize that h F , hs, g F (t), gs(t) an d w(t) all count not only the contribution 
due to the up-link from the end node to the relay node but also that due to the downlink from the relay 
node to the end node. Then the received waveform y(t) at the end node can be expressed 

L L 

y(t) = h F c F [n]g F {t - (n - 1)T) + £ h s c s [n]g s {t - (n - 1)T - T d ) + w(t) (2) 

n—l n—1 

1 This can be easily guaranteed with the help of upper layer protocol. For example, in TREAN protocol, the two-way relaying 
cooperation only happens between frames with same type and hence with similar sizes. 
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After sampling process, the ith sample is given by 

y[i] = ^/ii.c F [n] 5F ((i-l)-+A-(n-l)T)+^/iscs[n]5s((i-l)^+A-(n-l)T-T d )+w((i-l)-+A) 

71—1 71—1 

(3) 

Where A is the time at the first sampling position of packet F and we have that ^ A ^ Considering 
odd-index samples, we have 



Vodd[k] = y[2k—l] 



(4) 



= ^h F c F [n]g F (kT + A-nT) + ^hsc s [n]gs(kT + A-nT-T d )+w(kT-T + A) (5) 

n—l n—1 
L L 

= ^h F c F [n]g F (kT + A-nT) + ^hsc S [n]gs{kT + S-nT-DT)+w(kT-T + A) (6) 



Where D = [(Td — A)/TJ and 6 = Td — A — DT (without loss of generality, we can assume that < 5 < |). 
In addition, let [0, LhT] denote the region^ where the value of g v {t) {v G {S, F}) is evidently above the level 
of noise strength. To simplify the equation (j4)), we can assume that 



9v(t)=0 

when t < or t > LhT. Hence, equation (j4j can be simplified as 

k k-D 

y dd[k] = ^2 h F c F [n]g F (A + (k - n)T) + ^ hscs[n]gs{S + (k - D - n)T) 

n=k — Lh + \ n—k—D — Lh + l 

+w(kT-T + A) 

L h -1 L h -1 

= h F c F [k-n]g F (A + nT)+ ^ h s c s [k - n - D]g s (5 + nT) + +w(kT - T + A) 

n=0 n=0 

The previous equation can be written in matrix form as 



(7) 



Vodd = [c F c s ] 



h F ,odd 
hs,odd 



W dd 



where h Fo dd and hs odd are matrixes with dimension Lj, x 1 and 



h F odd = h F g F . dd = h F 



hs,odd — hsgS,odd — hi 



9f(A) 
g F (A + T) 

g F {A + L h T) 

9s(S) 
9s(S + T) 

g s {8 + L h T) _ 



(8) 



(9) 



(10) 



2 The length of this region is close to the delay spread r of the channel, which is about a few symbol times in real systems 
as indicated by 1231 . 
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C F and Cs are matrixes with the dimension (L + D) x L] x and 



C F 



c F [l] 
c F [2] 





c F [l] 



c f [L] c f [L-1] 
c F [L] 



c s [L] c s [L-l] 
c s [L] 





w dd is a matrix with the dimension (L + D) x 1 and 











cs[l] 
cs[2] 












cs[l] 






c F [i-ifc + l] 
c F [L-L h + 2] 

c F [L) 











c s [L-L h + l] 
c s [L-L h + 2] 

cs[L] 



Wodd 



w(A) 
w(T + A) 



w((L + D — 1)T + A) 
Similarly, even-index samples y e « e n can be denoted as 



Veven = [CfCs] 

where h F , e ven, h s , even and w even is given by 



h>F,even 
hs .even 



+ W e 



h F . 



h-FgF ,e 



hp 



hs,, 



hsgS,e 



hs 



g F (A + T/2) 
g F (A + T/2 + T) 

g F (A + T/2 + L h T) 

9s(S + T/2) 
g s (S + T/2 + T) 

g s (S + T/2 + L h T) _ 



and 



w„ 



w(T/2 + A) 
w(T + T/2 + A) 

w((L + D - 1/2)T + A) 



packet F 



preamble seq. 



data postamble seq. 



packet S 



preamble seq. 



data postamble seq. 



superposed waveform 



yi 



y3 



y2 



Figure 2: Superposed packets. We can observe that the preamble sequence of packet F is not collided by 
the data field of packet S, while the postamble of packet S is free from the interference of the data field of 
packet F. 



Consider the first 2L p samples y\ and the last 2L p samples 7/2, i-e. samples which coincide with the preamble 
of packet F and the postamble of packet S, as shown in Figured) Then odd-index samples in y\ and yi can 
be expressed as 

2/1, odd 
2/2, odd 



where C'f,i consists of the first L p rows of Cp and Cf,2 consists of the last L p rows of Cp. Cs.i, Cs,2, wi jOC id 
and W2,odd are defined in a similar way. We note that entries in the first L p rows and the last L p rows of 
the matrix Cf and Cs only involves symbols in the preamble or the postamble and hence are known by the 
receiver. Therefore the matrix C es t is fully known and we can use Equation (|18l) to estimate the channel 
coefficients hp.odd and hs,odd- Based on the Least Square Estimation, we have 

hp,odd 
hs,odd 

Similarly, it can be shown that 



Ul,even (21) 
U 2, even 

We note that the inverse matrix of C^ st C es t exists if and only if the matrix C es t has full rank. If the 
same set of pilot sequences, including the preamble and the postamble, is chosen by packet F and packet 
S, it is required that two packets have relative delay at least Lh symbol times to guarantee the full rank 
of C es t- As mentioned previously, this time period is close to delay spread of the channel and is on the 
level of nanoseconds. Hence the relative delay between two packets within this time duration can be easily 
avoided through deliberately introducing delay by the MAC layer when necessary. Another solution is to 
use different sets of pilot sequences for packet F and packet S. In this case, it can be shown that the rank of 
C es t is always full, and hence the MAC layer only need to establish the rules about how to choose different 
sets of pilot sequences in different situations. This solution incurs less complexity on the MAC layer design 
and also avoid the overhead due to the deliberately introduced delay in previous solution. However, this 
scheme doubles the computational load for packet detecting, i.e. the receiver has to correlate signals samples 
with four sequences (preamble sequences and postamble sequences for packet F and packet S individually) 
to detect the beginning and the ending of packets. The better solution is to choose the same set of pilot 
sequences for packet S and packet F but to use sequences with different order, i.e. the preamble of packet 
F is the postamble of packet S while the postamble of packet F is the preamble of packet S. By this mean, 
the receiver only needs to correlate signal samples with two sequence^ and hence the computational load 

3 Known that two packets overlap with each other, this operations would not result in the confusion between the ending of 
packet F and the beginning of packet S. 



Cf.1 
Cf.2 



Cs.i 

Cs,2 





h>F,odd 


+ 


Wl <0 dd 




hs.odd 




W 2 ,odd _ 



= c K 



h,F,odd 


+ 


Wl.odd 


hs.odd 




W 2 ,odd 



(18) 
(19) 



2/1, odd 
2/2, odd 



(20) 
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for packet detecting keeps unchanged. At the same time, similar with the previous scheme this solution can 
guarantee the full rank of the matrix C est in any cases. Hence due to its several advantages, this solution is 
adopted in most scenarios. 

In the end of this section, we emphasize that channel coefficients h v , u (v 6 {F, 5}, u e {odd, even}) 
count not only fading terms hp (or hs) but also the pulse-shape gain <?f(A + nT) (or g$(S + nT)). In 
general decoding procedure, with timing synchronization the sampling offset A is fixed to a certain value 
corresponding to the optimal sampling point, and hence the pulse shape gain is constant. However, when 
two packets superpose together but without any coordination, their relative delay Td may change from 
transmission to transmission, and hence so do A and S. Therefore we cannot assume the channel coefficients 
are constant even in slow fading channel and have to estimate it every time. 



Self-packet Identification The receiver also need to determine which packet, packet F or packet S, is 
transmitted by itself. This can be achieved by correlating waveform samples with the symbol sequence c k [n] 
of the known packet. Because that 2X oversampling is adopted, every symbol would be sampled twice. 
Hence y[2n — 1] and y[2n] correspond to the same symbol in packet F. Therefore if the symbol sequence of 
known packet align with the position of packet F in superposed waveform the correlation can be calculated 
as 

L 

Corr FA = 5^y[2n-l]c fc [n] (22) 

71=1 

and 

L 

Covrp.2 = ^y[2n]c fe [n] (23) 

71=1 

Then we can use Rf = max{CorrF,i,CorrF,2} to indicate the relevance between the packet F and known 
packet. Similarly, if i denote the first sample of packet S, when the symbol sequence of known packet align 
with the position of packet S in superposed waveform the correlation can be calculated as 

L 

Corrs.i = f[(*o - 1) + 2n - l]c k [n] (24) 

71=1 

and 

L 

Corrs.2 = W[(*0 ~ X ) + 2n ] c M (25) 

71=1 

Also 

Rs — max{Corrs,\,Corrs,2} (26) 

If Rf Rsi the relevance between packet F and known packet is higher than that between packet S and 
known packet. Hence the receiver decide that packet F is known packet while packet S is desired packet. 
However, if Rp < Rs, the decision is reversed. Without loss of generality, we can assume that packet F is 
already known by the receiver and packet S needs to be decoded. 

Waveform Recovery Based on previous knowledge, we remove the components due to known packet 
from waveform samples to enable the decoding for desired packet. The processed samples are denoted as 
Us,odd (odd- index samples) and ys,even (even-index samples), and they are given by 

Us.odd = Vodd - Cphp. odd (27) 
= Csh s ,odd + Cp{hp, od d — hp.odd) + w od d (28) 
= C s h s ,odd + Wodd (29) 

and 

-Cphp 

.even 

(30) 

= Cshs^even + Cp{hp^ even — hp^ even ) + W even (31) 

= Cshs,odd + w even (32) 
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Up to now, we can decode packet S based on Equation (p?Tf or Equation (I3U1) . However, due to the uncertainty 
of A and S, the pulse shape gain gp (or gs) may not be optimal. This results in the degradation of signal- 
noise ration, which is proportional to \hs,u\ 2 / E[w%J = \Ss,u\ 2 /E[wl] (u G {odd, even}), and hence has 
negative impact on decoding performance. The better solution is to recovery the waveform of packet S taking 
advantage of redundant samples and relocate the optimal sampling points. For convenience, we emerge ys,odd 
and ys,even together as 

[k] = | y W(fc + l)/2] k is odd 
y 1 J \ ys,even[k/2] k is even v ; 

Then the waveform of packet S can be recovered as 

Vs{t) = Vs[n]sinc{ — ^-) (34) 

n=l 2 

In reality, the calculation of ys(to) is approximated by the summation over a few items the indices of which 
are close to 2to/T. 

Decoding The recovered waveform can be decoded with general decoding procedure. Hence we can con- 
sider all of previous steps as a preprocess unit before general decoding block which extracts the waveform of 
desired packet from superposed waveform. Also, this preprocess unit is independent of the decoding block 
following it. This feature make our algorithm applicable to a wide variety of modulation schemes. 

3 Two-way Relaying Enhanced Ad-hoc Network Protocol 

In this section, we propose a new protocol called TREAN (Two-way Relay Enhanced Ad-hoc Network) that 
incorporates our new physical-layer scheme into general ad-hoc network to boost the performance. The 
protocol is a random access MAC scheme in nature and borrows some essentials from CSMA/CA protocol. 
It performs RTS/CTS-like queries to build the cooperative configuration for two-way relaying. After that the 
new physical layer scheme is applied to conduct two-way relay transmission with least coordination. In the 
rest of this section, we first present the basic mode of TREAN protocol to provide a detailed description about 
how TREAN protocol works. Then two extended modes are discussed to further enhance the performance 
of TREAN protocol. 

3.1 The Basic Mode 

The two-way relaying cooperation process in TREAN protocol can be divided into three sub-processes, as 
shown in Figure [3l The handshaking process sets up the connections between stations for the cooperation 
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Figure 3: The two-way relaying cooperation process in TREAN protocol 

and clears the channel for data transmissions. Following that the two-way relay process adopts our new 
physical layer technique to transmit data packets. Finally, the ACK process reports successful transmissions 
also with the help of our two-way relaying scheme. The detailed procedure are discussed as follow. 
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Figure 4: The schematic diagram for illustrating the basic model of TREAN. The station A has a packet 
with stations B and C as next two hops on its routing path and initiate the transmission by sending the 
RTS frame. Station C is called next two-hop destination of the packet. The next two-hop destination is not 
necessarily the final destination of the packet. 



Handshaking process Consider a station A with a data packet to transmit, as shown in Figure QJ As 
CSMA/CA protocol, the station sends the RTS frame first when the channel is sensed idle and the backoff 
time counter decreases to zero. The contents of the RTS frame includes those in its CSMA/CA counterpart. 
Besides, the address of next two-hop destination on the routing path of the data packet should be added in 
the RTS frame for TREAN protocol. The format of modified RTS frame is shown in Figure [5J If the source 
routing is used, we can extract the address of the next two-hop destination from data packets directly. 
If the hop-by-hop routing is used, to provide this information, the routing table at every station should 
indicate next two hops towards each possible destination. This can be achieved by exchanging the routing 
information between neighbor stations periodically. A special case is that the data packet would reach its 
final destination after next hop, i.e. the next two-hop destination does not exist for the packet. In this 
scenario, a special RTS are sent and the transmission procedure is same with CSMA/CA protocol. 
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Figure 5: The format of the RTS frame in TREAN. The Frame Control field and FCS field are same as those 
specified in 802.11 Standard [24]. RA and TA are addresses of the receiver and the transmitter of the RTS 
frame. NA is the address of next two-hop destination of the data packet being transmitted. 

Also, we modify the blocking function of the RTS framjB I n TREAN protocol, the RTS frame only blocks 
neighbor stations in the period from the end of the transmission of itself to the time when the data frame 
should be transmitted. If the handshaking process is free from the interference and hence the data frame 
is transmitted, neighbor stations would keep inactive until the estimated ending time of the cooperation 
according to the NAV information carried by the data frame. The reason for this modification is based on 
two facts. On the one hand, because the whole transmission process under TREAN protocol contains a 
two-way relaying cooperation, as shown in Figure [3j the NAV time duration (without modification) in the 
RTS frame of TREAN protocol would be about two times larger as that in CSMA/CA protocol. On the 
other hand, the two-way relaying cooperation in TREAN protocol involves three stations, and this relatively 
complicated configuration is more vulnerable to hidden node problem which may result in the failure of the 
transmission. Therefore it is possible that the cooperation initiated by a RTS frame fails but all of neighbor 
of the transmitter of RTS are blocked for a long period. This has negative impact on the performance. 
With the modification, the blocking on neighbor stations would be removed after a short time period if the 
cooperation initiated by the RTS frame fails. 

In addition, after the transmission of the RTS frame, a RTS-framc timer is set up. If no valid CTS frame 
are received before the timeout, the station increases its backoff stage and take a random backoff. 

4 The blocking function of the RTC frame and the ATC frame is modified in similar way 
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If the RTS frame are successfully sent to its receiver, the receiver would transmit a RTC (Request to 
Cooperate) frame to the station whose address is written in NA field of received RTS frame after waiting 
for SIFS period. For example, as shown in Figure HI if station B are correctly received the RTS frame from 
station A, it would send RTC frame to station C. Also, this frame can be overheard by station A. 
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Figure 6: The format of the RTC frame in TREAN protocol. The Frame Control field and FCS field are 
same as those specified in 802.11 Standard [23]. RA and TA are addresses of the receiver and the transmitter 
of the RTC frame. NA is the address of the transmitter of RTS frame. 

The format of the RTC frame is shown in Figure |H1 The information carried by this frame includes not 
only the addresses of the transmitter and the receiver of this frame, but that of the sender of the previously 
received RTS frame. Also, the operation mode of TREAN protocol is specified by this frame. This is achieved 
by using different values in the subtype sub-field contained in frame control field of the frame, as shown in 
Table [U 



Type Value 
b3 b2 


Type 
description 


Subtype Value 
b7 b6 b5 b4 


Subtype description 


Size 


01 


Control 


0000 


RTS with the next two-hop destination 


26 bytes 


01 


Control 


0001 


RTS without the next two-hop destination 


20 bytes 


01 


Control 


0010 


RTC for basic mode 


26 bytes 


01 


Control 


0011 


RTC for extended mode 1 


39 bytes 


01 


Control 


0100 


RTC for extended mode plus 


39 bytes 


01 


Control 


0101 


ATC 


26 bytes 


01 


Control 


0110 


CTS for basic mode and extended mode 


20 bytes 


01 


Control 


0111 


CTS for extended mode plus 


26 bytes 


01 


Control 


1000 


CTS for one-way relay 


20 bytes 



Table 1: The subtype field for control frames in TREAN protocol. All of these combinations of type field 
and subtype field are reserved and not used in 802.11 standard |24) . 

If the RTC frame are correctly received by its receiver the station C, the address written in NA field of 
the RTC frame are extracted firstly. Then the station C would check the transmission buffer to determine 
whether it has a data packet towards the station indicated by this address (the station A in our example). 
If there does exist such a packet, the station C would send back a ATC (Answer to Cooperate) frame to the 
transmitter of the RTC frame, namely station B. The format of the ATC frame is shown in Figure [7] 

Also, the RTC frame can be overheard by station A. If station A takes no action after receiving this 
frame, both station A and station B have no transmission until the broadcast of the CTS frame as shown 
in Figure [SJ This may cause that the channel around the station A is recaptured by other nodes due to no 
transmission in nearby region. The case is worse when the ATC frame is transmitted with a relatively low 
rate and hence the vulnerable period becomes longer. Also, if the station density is high, the recapture of 
the channel tends to happen more frequently. 

The blocking function of the RTS frame in CSMA/CA protocol can only solve the issue partially. The 
stations outside the communication range but in the interference range of station A are not affected by RTS 
frame, and they can recapture the channel and cause the failure of the cooperation initiated by station A. 
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Figure 7: The format of the ATC frame in TREAN. The Frame Control field and FCS field are same as 
those specified in 802.11 Standard [23]. RA and TA are addresses of the receiver and the transmitter of the 
ATC frame. NA is the address copied from the NA field of the RTC frame. 
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Figure 8: The vulnerable period for station A. 



The better solution for this problem is using CPP (Channel Protection Packet), i.e. the station A sends 
a packet after receiving RTC frame to protect its channel, as shown in Figure [3] Although solving previous 
issue, this may cause the signal superposition of ATC frame and CPP frame at station B. However, if the 
CPP frame is known by the station B, then our physical layer technique can be applied to cancel the CPP 
frame and extract the ATC frame. 

We can choose the RTS frame previously transmitted by station A as a CPP frame. Also, we require that 
the station C changes the common order of pilot sequences for its ATC frame, namely swap the preamble 
and the postamble, to guarantee the function of channel estiamtion in our physical layer scheme. If all of 
these are done, station B can retrieve buffered RTS frame and use it extract ATC frame with our physical 
layer decoding algorithm once the collision happens. 

If the station B receives the ATC frame before the timeout of RTCframe-timer, then it broadcasts a CTS 
frame to station A and station C. This frame serves as an authorization for the two-way relay cooperation 
between station A and station C and at the same time blocks the transmissions of all other neighbors of 
station B. However, if no ATC frame received before the timeout, station B presume that station C does 
not has a data packet towards station A. In this case, station B would broadcast a special CTS indicating 
that one-way relay are performed in following stages. The format of the CTS frame is shown in Figure |H1 
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Figure 9: The format of the CTS frame in TREAN. The Frame Control field and FCS field are same as 
those specified in 802.11 Standard [24]. RA and TA are addresses of the transmitters of the RTS frame and 
the ATC frame respectively. Also, an EA field is added in extended mode plus to indicate the destination 
of backward data frame. 



Two-way relay process If the CTS frame for two-way relay is received by station A and station C, they 
would transmit their data packets to station B after a SIFS time period. As the case of the ATC frame, 
station C swaps the preamble and the postamble for its data frame. Then station B amplifies and forwards 
the received waveform to the transmitters of two data packets. Once receiving the waveform, station A and 
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station C can decode their desired data packets with our physical layer decoding algorithm. However, if the 
CTS frame for one-way relay is broadcast, then only station A transmit its data packets to station B. Once 
receiving the packets, the station B forwards the packets to the station C after waiting for SIFS. 

ACK process If station A and station C decodes data packets correctly, they send ACK frames to 
announce the successful reception, and the transmission process is also in two-way relaying manner as shown 
in Figure [3J 

Because the structure unit of our physical layer involves three nodes to cooperate, it is more complicated 
and larger than that of traditional physical layer. Hence our scheme suffers more serious hidden node 
problem than those traditional ones. One of effects of this issue is increasing the loss rate of ACK frames. 
Our solution to this problem is to acknowledge preivously received data frames from same sender in current 
ACK frame. This can be achieved by including frame IDs of recently received data frames in the ACK frame 
to notice the successful transmission of these frames. The format of the ACK frame in TREAN protocol 
is shown in Figure [TO] Under this ACK scheme, a station buffers the transmitted data frame instead of 
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Figure 10: The format of the ACK frame in TREAN. The Frame Control field Duration field, FCS field are 
same as those specified in 802.11 Standard [24]. RA is addresses of the transmitter of the data. IDl, ID2 
and ID3 are frame ID for most recently received data frames from RA. 

retransmitting immediately if the corresponding ACK frame is not received. Until the next ACK frame 
from the same transmitter of the lost ACK arrives, the station can determine whether previous data frame 
is received according to the ID fields in the ACK frame and then retransmit or delete the frame from the 
buffer. 

3.2 Extended Modes 

Extended Mode The basic mode of TREAN protocol leads to an enormous improvement on the through- 
put of the network when the answer-to-cooperation(ATC) probability is high. The ATC probability is here 
defined as the probability that the station has data frames to send back towards the address in NA field of 
the RTC frame. However the performance degrades when the ATC probability goes down. Unfortunately, in 
many realistic systems, the ATC probability is low. To further improve the performance of TREAN protocol 
even in low ATC-probability scenario, extended mode of TREAN protocol is needed. 

To run the extended mode of TREAN protocol, every station in the network should maintain a special 
neighbor table, called close neighbor table. The table contains neighbors which are close to the station in 
distance and hence have high-quality communication links with the station. The threshold for recognizing 
a close neighbor depends on the noise-level, the station density of the network, etc. In addition, stations 
should exchange the information of close neighbor tables with their all neighbors periodically. 

The basic idea behind the extended mode of TREAN protocol is to provide the cooperation opportunity 
to more stations. As shown in Figure [TTJ we assume that station C does not have a data frame towards the 
station A but station E has ones coincidentally. In the basic mode, the two-way relay cooperation cannot 
come into being due to no backward data frames in station C. However, if the cooperation opportunity can 
be offered to station E, then a special two-way relay cooperation can form to transmit the frame of station 
A to station C and at the same time send that of station E to the station A with the help of station B. 

The offer of the cooperation opportunity to more stations are done by station B with the help of the 
extended RTC frame, as shown in Figure [6] After receiving the RTS frame from station A, the station B 
randomly chooses some stations from the close neighbor table of station C (indicated by NA field of the 
RTS frame), and writes addresses of these stations in the EA fields of the extended RTC frame indicating 
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Figure 11: The schematic diagram for illustrating the extended mode of TREAN protocol. The station A 
has a packet with stations B and C as next two hops on its routing path and initiate the transmission by 
sending the RTS frame. Station E is a close neighbor of station C, and both of them may have data frames 
to send back to the station A. 



that these stations are also allowed to participate in the cooperation beside the station C. For convenience, 
we call all of these stations (including stations C) as qualified stations. Evidently, the maximum possible 
number of qualified stations is equal to the number of EA fields uea i n an extended RTC frame plus one. 
Hence the station B can randomly allocate a sequence number in the range from to Uea to every qualified 
station and guarantee that no stations share the same sequence number. These allocated sequence numbers 
are written in the TS field of the extended RTC frame in the specified order and reflect the priority of 
qualified stations participating in the cooperation. Specifically, a qualified station with sequence number i 
has to wait for the period equal to (SIFS + i x TimeSlot) before its transmission of ATC frame. However, to 
avoid the interruption by irrelevant stations, the waiting time for qualified stations should not be longer than 
DIFS. This sets the limitation on the maximum sequence number uea, i.e. the maximum number of EA 
fields in a extended RTC frame should not be greater than (DIFS - SIFS)/TimeSlot. If DIFS and SIFS are 
specified as those in 802.11 standard [24], the number of EA fields should not be more than two. However, 
according to the simulation result, two EA fields is enough to enhance the throughput dramatically in low 
ATC-probability scenarios. 

Once receiving the extended RTC frame, each qualified station would send its ATC frame after required 
waiting time if the station has data frames to send back to the station A. Before the transmission of ATC 
frames, every qualified station should keep sense the channel. If a strong signal is sensed, the qualified station 
can know that at least one of nearby stations is in the transmission state. However all of nearby stations 
except qualified ones has to wait at least DIFS after the transmission of the RTC frame. Therefore the 
qualified station can determine that another qualified station with less sequence number has transmitted a 
ATC frame to the station B, and hence cancel the ATC frame transmission itself. In this way, there exists 
only one ATC frame sent to the station B unless that all qualified stations have no backward data frames 
towards station A. This avoids the collision of ATC frames from different qualified stations. 

At the same time, the transmitter of the RTS frame, i.e. station A, overhears the transmission of the 
extended RTC frame. As the basic mode of TREAN protocol, once the RTC frame is received station A 
transmits a CPP frame after a SIFS time period. Nevertheless, in the extended mode the transmission of 
the ATC frame could starts until (SIFS + i x TimeSlot) time period after the reception of the extended RTC 
frame as discussed previously. Hence the CPP frame and the ATC frame may superpose at the station B with 
relative delay as large as a few slot times. In this case, OFDM-based or LCC-based physical layer techniques 
for two-way relaying cannot work because the asynchronization is beyond their limitation. Fortunately our 
physical layer scheme can deal with this case and help station B extract the ATC frame from superposed 
waveform. 

Once a ATC frame from qualified stations are received before the timeout of RTCframe-timer, the station 
B broadcast a CTS frame containing the addresses of station A and the transmitter of the received ATC 
frame. This indicates that the two stations can transmit their data frames in two-way relay process and 
others should keep from sending any information. 

Without loss of generality, we can assume that station E successfully send a ATC frame. Once receiving 
the CTS frame, station E and station A transmit their data frames to station B after the time period equal 
to SIFS plus a tiny random delay. At the same time, the station C should overhear the transmission of 
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station E, although the frame is not targeted for it. However, due to the concurrent transmission of station 
A with station E, the overhearing suffers the interference from station A. Nevertheless, this interference 
is negligible for two reasons. First, the direct link between station A and station C is weak. Otherwise 
it is not necessary to add a station B in routing path between station A and station C. Also, the signal 
strength from station E to station C is strong. This is due to the fact that station E, as a qualified station, 
is chosen from the close neighbor of station C . Therefore the signal from the station A is easily surpassed 
by that from station E, and has less impact on the decoding of the overheared frame. If station C obtain 
the data frame of station E, it can apply our physical technique to extract the data frame of station A from 
superposed waveform received in the broadcast stage of two-way relay process. Also, station A can extract 
its desired frame with our physical layer decoding algorithm. By this way, two data frames are delivered to 
their respective destinations. Then two stations relay ACK frames as the basic mode of TREAN protocol. 

Extended Mode plus If further improvement on the performance of TREAN protocol in low ATC- 
probability scenario is needed, we can increase the number of EA fields in an extended RTC frame to 
provide the cooperation opportunity to more stations and hence enhance the probability that two-way relay 
cooperation comes into being successfully. However, the number of EA fields is usually limited by the 
parameters SIFS, DIFS and time slot as mentioned previously. Another solution is to run the extended 
mode plus of TREAN protocol. The basic idea behind the extended mode plus is to relax the limitation on 
backward data frames, namely qualified stations can reply ATC frames not only when they have data frames 
to station A but also when they have ones to a close neighbor of station A. This dramatically increases the 
probability of relaying ATC frames, and hence enhance the probability for successful formation of two-way 
relay cooperation. However this is at the cost of the management overhead, i.e. every station has to know 
close neighbor tables of its two-hop neighbors beside those of its direct neighbors. 

The runing of the extended mode plus can be further explained with the help of an example shown 
in Figure [12] The qualified staion E has a data frame to station F, a close neighbor of station A. If no 
qualified stations with less sequence number have data frames to station A or its close neighbor, station 
E would transmit an ATC frame after waiting required time period. Once authorized by the CTS frame, 
station E and station A send their data frames to the station B. Meanwhile, the station C and the station 
F overhear transmissions from their respective close neighbors. After that the station B would amplify 
and forward received superposed waveform to the station C and the station F, and then they can use their 
overheard frame to extract their desired frames from the waveform with our physical layer scheme . 



Figure 12: The schematic diagram for illustrating the extended mode plus of TREAN protocol. The station 
A has a packet with stations B and C as next two hops on its routing path and initiate the transmission 
by sending the RTS frame. Station E is a close neighbor of station C, and station F is a close neighbor of 
station A 

At the end of this section, we emphasize that although extended modes provide better performance in 
low-ATC-probability scenario, they would also result in larger management overhead. The extended mode 
requires stations to maintain close neighbor tables and exchange them with all neighbor stations. In extended 
mode plus, close neighbor tables of two-hop neighbors are further required to be known by stations. However, 
all of these are nor prerequisite in the Basic mode. Therefore we should select suitable mode according to 
different application scenario. In mobile network or the network with high bi-directional data flows, the 
basic mode is better solution, while in stationary network with low ATC-probability, extended modes are 
needed to enhance the performance. Also, the adaptive switch between different modes are feasible. 
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4 Performance Analysis 



In this section, we adopt Markov model to derive the saturation throughput of the network with TREAN 
protocol. The objective is to evaluate the performance of TREAN protocol in theory. Also, the theoretical 
results can provide insights and guidelines for selecting optimal parameters in protocol design. 

As prior works US], we define the throughput as the successfully transmitted payload in the network 
within unit time period. However, in this paper, we redefine the saturation condition as every station always 
has packets to transmit and each station has packets to send back when cooperative request is received, 
namely the transmission queue for any potential two-hop destinations are always non-empty. Also, to show 
the full capacity of TREAN protocol, we assume that all of transmissions are in two-way relaying manner, 
i.e. special RTS for the data frame without next two-hop destination is never initiated. 

The derivation is divided into two subsections according to the size of networks where TREAN protocol 
is applied. In subsection 14. 1[ we derive the saturation throughput for small-scale network. Stations in this 
type of network can sense transmissions from all other stations, namely the network is free from the hidden 
node problem. In subsection 14.21 we provide an derivation of saturation throughput of the network deployed 
in large area. In this scenario, the size of the network is beyond the sensing range of the station, and hence 
the performance of the network would be affected by hidden stations. 



4.1 Saturation Throughput in small-scale network 



Consider n stations in a small-scale network. For every station, it may stay in different backoff stages and 
have different values in its backoff-time counter. Let Si j denote the state that the station belongs to the 
ith. backoff stage and have the value of the backoff counter equal to j . Note that the backoff stage is upper 
bound by constant to and the value of backoff counter in stage i should be in range from zero to contention 
window Wi minus one. 

There exists two important probabilities affecting transitions between different states {Sij}. One of them 
is the transmission failure probability pf defined as the probability that collision happens in transmission 
process. As mentioned in [26], it is reasonable to assume that pf is uncorrelated to the number of retrans- 
mission. Another key probability is the cooperation probability p c , defined as the probability that a station 
accept to a two-way cooperation request and transmit a packet successfully in the cooperation. We assume 
that the cooperation probability is independent from backoff stages and values of backoff time counter due 
to the fact that the cooperation process is less related to the backoff behavior. 

Based on previous discussion and inspired by prior work 26 , we can model the transitions between states 
{Sij} as a discrete-time Markov chain. In our Markov chain, all of non-zero state transition probabilities are: 



p{S 0d \Six,} = 
p{Si+i,j\Si,o} 

p{So.k\S id } = 



1 ~Pf 
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Pf 
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(35) 



piSij-lfiij} = l-p c < * < TO, 1 < j S? Wi - 1 



The first three equations in (f3"5j) correspond to backoff behavior after collision or successful transmission, 
which is similar with CSMA/CA protocol. However, the last two equations are special in the network 
adopting TREAN protocol. The fourth equation describes the state transition due to being requested to 
participate in the two-way relay cooperation. Once a station answers to cooperate in two-way relay manner 
and transmits a data frame successfully in the cooperation, the station reset its beckoff stage to zero and 
take a random backoff, just as a successful transmission under CSMA/CA protocol. The last equation in 
([35|) accounts for the fact that as long as a station does not involve in a successful two-way relay cooperation 
it would reduce its backoff counter when channel is sensed idle for a certain period. 
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To determine the stationary distribution {vij}, we note that 
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Based on recursive relations in equation (|36j) . we can show that 
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Substitute equation (|39|) into equation (|38l) and choose j = 0, we can obtain new recursive equations about 



v t{ l ~Pf)^}_ • = 1 



Wo Wo p, 



<V.o = < 



Pf 



[l-(l- Pc ) W <] Vi-1,0 



Pf [i-(i-p c y 



{ p c W m -pf[l-{l-Pc) W " 

Therefore, n can be expressed as 



1 < i < m - 1 
Um-i,o i = m - 1 



(40) 



P^[«c + «t(l-p/)] A l-(l-p c )^ 

«*." = 11" 

Pc ° 4 fc=0 



W fc 



Where 



Hence, we have that 
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define 

JUL ni JL 1 _ n - n \ w " 

Then we have that 

vt = [v c + Vt(l-pf)]c(pf,p c ) (47) 
According to the definition of v c and Vf and last equation in (|36|) . it can be shown that 

v f = [(l-v t )p c + v t (l-p f )]c(p f ,p c ) (48) 

Moving vt to one side of the equality, we have that 

Vt = C<Pf,Pc)Pc (49) 

1 - C(p f ,p c )(l -Pc-Pf) 

Hence the transmission probability, defined as the probability that a station initiates the transmission in a 
randomly given time slot, can be expressed as 

c{Pf,Pc)Pc , Kn , 
Pt = y. v ifi = v t = i 7 -tt, r (50) 

For the sake of simplicity, we consider a symmetric setting where every station has equal opportunity 
to be requested to participate in the two-way relaying cooperation. In this case, we can assume that the 
cooperation probability p c is constant over stations in the network. Also, as mentioned in [26], it is reasonable 
to assume that pf keeps invariant for all stations. Furthermore, we note that the transmission probability is 
decided by the cooperation probability p c and the transmission failure probability p f . Hence we can conclude 
that the transmission probability p t is also unchanging for all stations. 

Let Dx denote the set of stations that have data frames with station X as their next two-hop destination 
and Tx represent the set of stations which are next two-hop destinations of data frames from station X. Also, 
p X y stands for the probability that station X transmits a RTS frame with the address of station Y in NA 
field, and p X y denotes the probability that such RTS frame is free from the collision and the cooperation 
process is successful. Then for a specific station A, the cooperation probability is given by 



J2 PxaPxa (51) 
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We add the cooperation probability for all stations together, then we have that 

= E^-P')- 2 E Pxy (54) 

Y Y XED Y 

= (1- Pt )- 2 E E Pxy (55) 

Y X£Dy 

= (1- Pf )"- 2 E E Pxy (56) 

X Y£T X 

= (i- Pt )"- 2 E^ ( 57 ) 

X 

We know that p c and p t keep constants for all stations, hence it can be shown that 

p c =p t (l~ Pt ) n ~ 2 (58) 
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As derived in {26), the transmission probability can be expressed as 



p f = l-(l-Pt) 



(59) 



Combine equation (1501) (|58p (|59[) . we can solve pt,Pf and p c with numerical methods. Although previous 
derivation is based on symmetric setting assumption, we should emphasize that our theoretical framework 
is not limited by this requirement. In fact, it can be applied to various scenarios. However, unsymmetrical 
scenarios results in relatively high computational complexity. For most general case, pt,Pf and p c are all 
different for different stations and we should solve 3n equations to obtain these probabilities for each station. 
In this case, some sophisticated numerical methods should be applied. For example, the secant updating 
methods can solve our problem at the cost of (n 2 ) operations [27] . 

Based on previous results, the length of generalized time slot A, defined as the time interval between two 
consecutive backoff counter decrements in [35], can be calculated as 



A PidlcF slot PcolFc ~\~ PsuccFs 



(60) 



Where Pidie, Psucc and P co i represents the probabilities that the channel is idle, captured by a successful 
transmission and occupied by a collision respectively. Also, T s i t, T succ and T co i denotes the length of a time 
slot, a successful transmission period and time duration of a collision respectively. These variables are given 

by 



Puu = {i-Pt) n 

Psucc = n Pt (l -Pt)^ 1 

Pcoi = 1 - (1 - PtT - np t (l - PtT- 1 



(61) 



and 



T s = RTS + SIFS + 5 + ETC + SIFS + 6 
+ATC + SIFS + 5 + CTS + SIFS + 6 

+BDATA + SIFS + 5 + BDATA + SIFS + S (62) 
+BACK + SIFS + S + BACK + SIFS + 6 
T c = RTS + DIFS + 5 

Where S denotes the propagation delay. Note that BDATA is a little bit longer than DATA due to non- 
complete overlap of two data frames without synchronization. However, this overhead is negligible and we 
can approximate BDATA with DATA. Also, previous discussion is true for BACK. 

In a successful two-way relay cooperation, two packets are received by their next two-hop destinations. 
This is equivalent to four transmissions in the traditional CSMA/CA scheme. Therefore the saturation 
throughput can be expressed as 

, 4P SUCC £'[P] 

<t>s = (63) 

where E[P] represents the average payload in a packet and P SUCC ,X are obtained from previous derivations. 



4.2 Saturation Throughput in large-scale network 

Due to the complexity of large-scale network with hidden node problem, it is difficult to provide an accurate 
derivation of saturation throughput. Instead, we present an approximate analysis on the throughput perfor- 
mance of this type of network in this subsection. Although several secondary factors are neglected and some 
assumptions are made, we catch the most important elements that play a significant role in the performance 
of the network. Therefore our approximate result can closely reflect the performance of TREAN protocol 
and provide a guideline for protocol design. 
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First of all, we determine the transmission failure probability p/ , denned as the station fails to receive the 
ACK for its data frame. It should be noticed that the transmission failure not necessarily means the failure 
of the transmission of payload bits. It is possible that the data frames are received correctly by stations but 
BACK frames are collided due to hidden stations. In this case, the station increases its backoff stage and 
takes a random backoff but buffers previous data frame as explained in section (|3.1I) . 

The transmission failure is caused by two reasons in the network with hidden stations. One of them is 
the collision of RTS frame. This happens when a station transmit a RTS frame and in the same time slot 
another station in the interference range of the receiver also begin its transmission. Hence the probability 
of transmission failure due to RTS collision is given by 

m = l-(l-p t )"*" 1 (64) 

where ni denotes the number of stations in the interference range of a station. If the stations are uniformly 
distributed with density A and the interference range is given by n, the rii can be expressed as 

rii — rf 7rA (65) 

Beside RTS collisions, the transmission failure may happen due to the existence of hidden stations. In 
the multiple access phases of the two-way relay cooperation process, as shown in Figure 1131 stations in 
the interference range of the relay node can sense at least one ongoing transmission and hence no hidden 
region exists. In the broadcast phases of cooperation process, corresponding to the transmission of RTC, 
CTS, BDATA and BACK frames, the transmitter are hidden from two regions indicated by shadow areas 
in Figure I13l lf stations in two shadow regions begin the transmission when the broadcast is underway, 
the corresponding stations would suffer from collisions. The vulnerable period T v for the collision can 
be calculated according to different scenarios. As shown in Figure HH if the broadcast stage follows a 
transmission from station A (or station C), the stations in corresponding hidden region have to wait at least 
DIFS period after the end of the transmission. Hence the vulnerable period for station A (or station C) is 
given by 

T v =T packet -(DIFS-SIFS) (66) 

Where T pac fe et is the length of broadcast frame from station B. However, if no transmission from station A 
(or station C) before the broadcast stage, the vulnerable period is simply 

T-v T^packet (67) 

Let Sa and Sc denote the hidden regions corresponding to station A and station C, and n/^Si represent the 
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Figure 13: The multiple access phase in the cooperation process. In this phase, station A and station C 
send message to the relay station B. Solid circles represent the range where corresponding transmitters can 
be sensed, and the dashed circle denotes the interference range of the receiver 

number of stations in region Si (i £ {A, C}). Based on previous discussion, we can determine the vulnerable 
period T vJ for station A or station C at j (j E {RTC, CTS, BDATA, BACK}) broadcast stage. Then the 
probability that no collision, due to the transmission initiated by stations in hidden region Si, happens at 
station i in j broadcast stage can be given by 

(l-p^sj^l (68) 
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Figure 14: The broadcast phase in the cooperation process. In this phase, the relay station B broadcast 
information to station A and station C. The solid circle represent the range where station B can be sensed, 
and dashed circles denotes the interference range for receivers. Shadow areas are hidden regions 



Where X is generalized time slot as defined in subsection 14.11 

Actually, the collision may also happens due to the transmission initiated by stations outside the hidden 
regions defined previously, as shown in Figure 1151 However, the requirements for the occurrence of this 
scenario are demanding: the station D should be not able to sense the transmission of station A, station B 
and station C and initiate the cooperation (i.e. transmit RTS) successfully; In addition, the next two-hop 
destination station F of the packet from station D should be coincidentally in the interference range of 
station A, station B or station C; Also, to respond the cooperation and transmit messages, station F has 
to receive packets from stations E correctly, i.e. coincidentally avoid from the ongoing transmission from 
station A, station B or station C. Therefore the situation does not happen frequently and neglect this case 
has less impact on the final result. 
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Figure 15: Pierced interference. The station D outside the hidden regions defined previously may initiate 
a two-way relay cooperation via station E with station F which is in the interference region of station C . 
Hence the transmission of station F may result in the collision on station C. We call this type of collisions 
as pierced interference 

Based on previous discussion, if we use (as shown in Figure I16p to approximate the number of 
stations in one hidden region , the probability of transmission failure due to the hidden node problem can 
be approximated as 

p /2 = l-(l-p t ) Bl (69) 

where 

m = 2( £ n h \^])+n h \^^] (70) 

\je{RTC,CTS,BDATA} J 

Hence the transmission failure probability is given by 

Pf = l-(l-P/i)(l~P/2) (71) 
= 1 - (1 - pt)"^- 1 (72) 
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Figure 16: Hidden region. Let r s , r% and r c denote the sensing range, the interference range and the 
communication range respectively, represents the number of stations in shadow region. 



Similar with the derivation of equation (1581) in small-scale network cases, the cooperation probability can 
be expressed as 

Pc = Pt(l - Pt) ni -' 2+ni (73) 

To find the value of the generalized time slot X, we consider the channel around the station A. Let Za 
denote the set of stations in the sensing range of station A and n s represent the number of elements in the 
set Za- Then the probability that the channel is idle can be expressed as 

Pidle = (1 - PtT* (74) 

If Ai denotes the event that station i transmits a RTS frame successfully, it can be shown that 

P[A i ]=p t {l-p t ) n *- 1 (75) 
Then the probability that at least one RTS frame are successfully transmitted is given by 



PRTSsucc = P 



(76) 



We should note that it is possible that in the sensing range of station A two or more RTS can be successfully 
transmitted concurrently. However, this situation only happens when the transmitters of these RTS are 
separated enough and hence the collisions are avoided. As explained in Figure [TTJ we consider the separation 
requirement for the concurrent successful RTS transmissions as that the transmitter stands (r^ + r c ) apart 
with each other. Let p2 C represent the probability that the distance between two randomly chosen points in 




Figure 17: Concurrent RTS transmission. Consider that A\ and Ai transmit RTS frame concurrently to 
B\ and Bi respectively. If the distance d between two transmitters are great than {r% + r c ), the distance 
between A\ and B2 are great than d — r-x- Also we know that ri is less than r c . Hence the distance between 
A\ and P>i are great than r^. Therefore there is no interference on station B2. This is also true for station 
Bx 

the disk with radius r s is great than (r^ + r c ). Let ps c represent the probability that the distances between 
three randomly chosen points in the disk with radius r s are all great than + r c ). Then it can be shown 
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that 

Prts succ — 

p{U A >] ( 77 ) 

= ^PlAJ-^n^+^PlAn^ni,] (78) 
« Y, p \M- E mnA^]+ E ^n^ni k ] (79) 

d(i,j)>r c +ri d(i, j)>r c + ri 

d(i,fc)>r c +r i 
d(fc,j)>r c +r i 

« n.pt(l -Pt)" 4 " 1 - (" 2 fl )p 2 cP?(l - ft) 2 " 1 - 2 + ( 



p 3c p?(l-p t ) 3 "^ 3 (80) 



where 



(81) 



P2C= ^ dxd V 

J d{x,y)>r c +ri V s n I 

P3c= /*c*,v)>r e +r j 1 dxdydz 

d(2,x)>r c + i- i 

Then the probability that all RTS frames from stations in Za collide is given by 

PrTScoI — 1 — Pidle — PrTSsucc (82) 

Let TrtSsucc represents the expectation of channel busy time known that at least a RTS frame from 
stations in Z a is successfully transmitted. Then the generalized time shot can be expressed as 

X = PidleTslot + PrTScoiT C oI + PrTSsuccE\TrtSsucc] (83) 

As previous discussion, the transmission failure due to the hidden stations may happen at every stage of 
the two-way relay cooperation after successful RTS transmission. Therefore it is complicated to provide an 
accurate expression for the expectation of TrtSsucc- Hence we take the average of the shortest channel busy 
time T a hort and the longest channel busy time Ti ong as an approximation of E[Trtssucc]- Known that RTS 
frames are successfully transmitted, the shortest channel busy time T s h or t corresponds to the case when RTC 
frame is collided, while the successful cooperation results in the longest channel busy time Ti ong . Hence we 
have that 

T sho rt = RTS + SIFS + RTC + DIFS 

(84) 

^long -*js 

Based on the equation (Jill), (|73| . (f83| and (|50")l . we can solve the transmission failure probability p / with 
numerical techniques. With this knowledge, the probability pis that two data frames are both correctly 
received by corresponding stations in one round two-way relay cooperation process and the probability p\ s 
that only one data frames are successfully received can be calculated as 



Ps2 



(1 - p t ) ni " 1+n2 (1 ~ PtT h [ ^ 1 (85) 

p sl = (1 - Pt )"'~ 1+ " 2 (2 - 2(1 - Pt )^r ^^ l) (86) 

where 

= £ n h \^f])+n h \^^] (87) 

\je{RTC,CTS} J 

Therefore, the saturation throughput for large-scale network is given by 

, N Pt {4 Ps2 + 2p sl )E[P} 
*! = ^ 

Where pt, p s i, p S 2 and X can be obtained from previous derivations. 
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5 Performance Evaluation 



To validate both our physical layer decoding algorithm and MAC protocol design, we perform simulations 
with MATLAB programme in this section. We first verify the performance of our physical layer decoding 
algorithm. Then, we evaluate the performance of TREAN protocol in various scenarios. 

5.1 The BER performance of our physical layer decoding algorithm 




Figure 18: Decoding performance in AWGN channel when BPSK modulation is adopted. The dashed line 
represents the BER performance of standard maximum likelihood decoding when only the desired packet is 
received. The solid line denotes the BER performance of our physical layer decoding algorithm when the 
desired packet is received with asynchronous interference from a known packet. 

Figure [T5] shows that the BER performance comparison between standard ML decoding and our physical 
layer decoding algorithm in AWGN channel. It can be observed that the decoding performance of our 
algorithm is quite close to that of standard ML decoding. This demonstrates that our decoding algorithm 
does not result in any performance loss even the desired packet is received with asynchronous interference 
from a known packet. Actually, the performance of our decoding algorithm is a little better than that of 
standard ML decoding. This is due to the diversity gain introduced by oversampling. 

5.2 The performance of TREAN protocol 

In this subsection, we evaluate the performance of TREAN protocol in various settings. The parameters 
using in the simulation, except the length of control frames in TREAN protocol presented in Table [1] are 
summarized in Tabled All of values given in the table are same with those specified in 802.11 standard 
[2~i] and 802.11a amendment [25]. Also, if without explicit indication, we adopt the setting that stations are 
uniformly distributed in the network in our simulation. 

5.2.1 Throughput performance of TREAN protocol in small-scale network 

Figure [19] illustrates the throughput performance of TREAN protocol in small-scale network as the increase 
of the station number in the network. We can observe that the performance gain of TREAN protocol over 
traditional CSMA/CA protocol is more than one hundred percent. The main contribution comes from the 
enhanced spectral efficiency of two-way relay technique. The technique allow two concurrent transmissions 
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packet payload 


1023 bytes 


MAC header 


34 bytes 


PHY header 


20 [is 


ACK (CSMA/CA) 


14 bytes + PHY header 


RTS (CSMA/CA) 


20 bytes + PHY header 


CTS (CSMA/CA) 


14 bytes + PHY header 


Channel Rate 


54 Mbps 


Propagation Delay 


« 1 [IS 


Slot time 


9 pis 


SIFS 


16 ns 


DIFS 


34 \xs 



Table 2: Parameters using in the simulation 

50 
g 49 
fie, 
g> 47 
£ 46 

45 

5 10 15 20 25 30 35 40 45 50 

Number of Stations 

(a) Saturation Throughput ot TREAN 

25 
In 24 
§23 
% 22 
H 21 

20 

5 10 15 20 25 30 35 40 45 50 

Number of Stations 

(b) Saturation Throughput of CSMA/CA 

Figure 19: Throughput performance of TREAN protocol in small-scale network. The symbols denote sim- 
ulation results which are the average values of 30 repeated experiments, while the solid lines present values 
calculated from theoretical equations. 

instead of one in traditional scheme, and hence double the throughput of the network. Also, the extra benefits 
of TREAN protocol comes from its more compact transmission manner. In one contention period, four data 
transmissions (equivalent) are performed in TREAN protocol comparing to only one in CSMA/CA scheme. 
Hence one data transmission in TREAN protocol partake one fourth contention overhead and one fourth 
backoff overhead, which is less than one contention overhead and one backoff overhead per data transmission 
in CSMA/CA protocol. 

Also the figure provide the information about the comparison between analytic results and simulation 
ones. It shows that our theoretical analysis can accurately predict the performance of TREAN protocol, 
and the error is always less than one percent. This validates our theoretical derivation and hence we can use 
this analysis tool to further study the performance of TREAN protocol and to optimize parameters such as 
initial contention window size and the number of stations deployed in the network in practical design. 

When perfect scheduling is adopted, the saturation throughput of the network with two-way relaying 
technique can approach 86% of the ideal rate 108Mbps. The fourteen percent performance loss is introduced 
by MAC layer header and PHY layer header added to payload bits. With random access MAC protocol 
TREAN, the saturation throughput is about 45% of the ideal rate. It seems that the application of two-way 
relaying in a scalable manner is at the cost of the performance degradation. However, the same problem exists 
in traditional CSMA/CA protocol. The saturation throughput of the network with 802.11 MAC layer is only 
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41% of the ideal rate 54Mbps comparing to 86% in the perfect scheduling case. Therefore the performance 
degradation is due to the inefficiency of CSMA/CA protocol itself (note that TREAN protocol is based on 
the CSMA/CA protocol). However several research works have been done to address this issue. In 30 , 
the efficiency of CSMA/CA protocol is boosted to 80% when using the rate 54Mbps. If we incorporate 
those sophisticated techniques in [3D] in TREAN protocol, the efficiency of our protocol would approach the 
perfect scheduling solution. However, this is out of the scope of this paper. 

5.2.2 Throughput performance of TREAN protocol in large-scale network 
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Figure 20: Throughput performance of TREAN protocol in large-scale network. The symbols denote simu- 
lation results which are the average values of 15 repeated experiments, while the dashed line present values 
calculated from analytic equations. 

Figure [20] shows the throughput performance of TREAN protocol in large-scale network as the increase of 
the sensing range. The results are obtained in the setting that the network contains three hundred stations 
and have size 10 x 10 if the length of communication radius is set to 1. Also, the interference range is given 
as 1.78 according to [29] . 

It can be observed that the throughput gain of TREAN protocol over CSMA/CA scheme is about one 
hundred percent. This is partially due to the more compact special reuse of TREAN protocol, i.e. two-way 
relay allows two concurrent transmissions which is quite close to each other. However this is impossible in 
CSMA/CA protocol. Hence, on average, more concurrent transmissions can coexist under TREAN protocol 
than under CSMA/CA protocol. Also, the reduced contention and backoff overhead per data transmission 
as mentioned previously contribute to the enhancement of the performance. This contribution is more 
significant in large-scale network comparing to that in small-scale network. This is because that backoff time 
slots of different stations are not synchronized with each other due to the fact that different stations have 
different sensing regions in large-scale network and hence have different sensing results. Therefore stations 
access the channel not in a slotted manner in large-scale network, and this results in more frequent RTS 
collisions and hence more contention overhead. 

In addition, similar with the CSMA/CA case, the throughput of the network under TREAN protocol 
first goes up and then decline with the increase of the sensing range. The increment of the throughput as 
the increase of sensing range is due to the notable decrease of collisions caused by hidden stations, while 
the decline of the performance when the sensing range further increases is because of more conservative 
spatial reuse. Also, we note that the optimal sensing range in TREAN protocol is larger than that in 
CSMA protocol. This is because the structure unit of physical layer under TREAN protocol is larger and 
more complicated than that of traditional physical layer and hence suffer more serious hidden node issue. 
Therefore we need larger sensing range to diminish the effect due to hidden stations in TREAN protocol. 
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Figure [50] also show the comparison between theoretic throughput and simulation results. It can be 
observed that our theoretic results can closely reflect the variation trend of throughput with the increase of 
the sensing range and the error comparing to the simulation results is always less than five percent. Hence our 
approximated derivation on the saturation throughput of large-scale network is enough to give a preliminary 
estimation on the performance of TREAN protocol and can provide sufficient insights and guidelines for the 
design in realistic systems. 

5.2.3 Extended Modes of TREAN protocol 
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Figure 21: Throughput performance of different modes of TREAN protocol 

Figure ED shows the throughput of the network when running different modes of TREAN protocol. In 
this simulation, the network is a small-scale one with forty stations in it. Also, the extended modes is 
implemented with two EA fields in an extended RTC frame, and the management overhead is not taken 
into consideration. We can find that the extend modes have much better performance than the basic mode 
of TREAN protocol in low ATC-probability region as expected. In addition, when the ATC-probability is 
zero, different modes of TREAN protocol have same performance and are about twenty percent worse than 
CSMA/CA protocol. This is partially due to the larger overhead of control frames in TREAN protocol. Also, 
in zero ATC-probability scenario, only one-way relay transmission is performed. This scheme is even more 
vulnerable to the hidden node issue and hence results in the performance degradation of TREAN protocol. 

5.2.4 CPP frame in TREAN protocol 




Figure 22: A network topology potential with the fairness problem. There are six station A, B, C, D, E, F 
in the network. The distances between AB, BC, DE, EE are all 0.8 and the distance between BE is 1.2 if 
the communication range is set to 1. We assume that station A and station C have data frames to exchange, 
so do station D and station F. 
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Figure 23: The comparison between the protocol with CPP and without CPP 

Figure [53] reveals the significance of the CPP frame for protecting captured channel. The simulation is 
run in a network potential with the fairness problem as shown in Figure 1221 The interference range is set 
1.78 as the previous simulation and the sensing range is equal to 2.9 which is optimal value for TREAN 
protocol. We can observe that when the CPP frame is present, the throughput of group 1 is comparable to 
that of group 2. However if we disable the CPP frame in the protocol, the throughput of group 1 degrades 
dramatically but the total throughput keeps unchanged. This indicates that the channel got by station D 
is recaptured by station A or station C without the protection of the CPP frame. Also, we note that the 
situation is more worse for low communication rate scenario. This is because that the lower rate results in 
longer control frames and hence increases the vulnerable period when the channel may be recaptured. 

5.2.5 ACK loss rate 



sensing range 


2.2 


2.4 


2.6 


2.8 


3.0 


ACK loss rate 


15.12% 


12.98% 


6.39% 


3.82% 


1.59% 



Table 3: ACK loss rate 



Table [3] summarize the ACK loss rate for different sensing ranges in large-scale network with TREAN 
protocol. It can found that the ACK loss rate is reduced as the increase of sensing range. This is due to the 
fact hidden node issue which results in the loss of ACK is mitigated when the sensing range increases. 

With our ACK scheme, the correctly received data frame is not acknowledged only when all three ACK 
frames with the ID of this data frame are lost. The probability that this situation happens is much lower 
than one percent even the sensing range is only 2.2 times as the communication range, and hence the negative 
impact of ACK loss on the performance is almost negligible. 

6 Conclusion 

In this paper, we propose a complete, practical and high-performance solution for applying the two way 
relaying in general ad hoc networks. The solution includes a new physical layer scheme for two way relaying 
and a random access MAC protocol TREAN. The new physical layer scheme does not require any synchro- 
nization and is feasible for any linear modulation schemes. This lays a solid foundation for the design of 
a wide applicable random access MAC protocol. On the top of this physical layer scheme, a 802.11-likc 
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protocol TREAN is proposed. TREAN protocol support two-way relaying technique to boost the perfor- 
mance of the network but avoid complicated scheduling and optimization. Hence it can be adopted in large 
scale network. Furthermore, to guarantee the performance gain even when bi-directional data flows are 
absent, extended modes of TREAN protocol are suggested. Simulation results and theoretical analysis both 
shows that our integrated solution can provide remarkable improvement on the performance comparing to 
traditional CSMA/CA protocol. 

The implementation of our solution requires the change on physical chip due to the redesign of the 
physical layer. However, the operations in our physical layer schemes only involve common ones such as 
correlation, matrix computation and interpolation. All of these can be implemented with standard DSP 
module. In addition, TREAN protocol is based on standard CSMA/CA protocol and hence can be obtained 
from the modification of existing system. The detailed implementation of our solution is left as our future 
work. 
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